PlantTFDB
Plant Transcription Factor Database
v4.0
Previous version: v3.0
Transcription Factor Information
Basic Information | Signature Domain | Sequence | 
Basic Information? help Back to Top
TF ID Glyma.20G178500.1.p
Common NameGLYMA_20G178500, LOC100810588
Organism
Taxonomic ID
Taxonomic Lineage
cellular organisms; Eukaryota; Viridiplantae; Streptophyta; Streptophytina; Embryophyta; Tracheophyta; Euphyllophyta; Spermatophyta; Magnoliophyta; Mesangiospermae; eudicotyledons; Gunneridae; Pentapetalae; rosids; fabids; Fabales; Fabaceae; Papilionoideae; Phaseoleae; Glycine; Soja
Family MYB
Protein Properties Length: 1665aa    MW: 181491 Da    PI: 5.6045
Description MYB family protein
Gene Model
Gene Model ID Type Source Coding Sequence
Glyma.20G178500.1.pgenomeJGIView CDS
Signature Domain? help Back to Top
Signature Domain
No. Domain Score E-value Start End HMM Start HMM End
1Myb_DNA-binding285e-09785826346
                          SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHHHH CS
      Myb_DNA-binding   3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrwqk 46 
                          +WT+eE e +++ ++ +G++ +++Ia+ +  ++t+ +c+++++k
  Glyma.20G178500.1.p 785 PWTPEEREVFLEKFAAFGKD-FRKIASFLD-HKTAADCVEFYYK 826
                          8*****************99.*********.***********98 PP

2Myb_DNA-binding33.78.6e-119731012344
                           SS-HHHHHHHHHHHHHTTTT-HHHHHHHHTTTS-HHHHHHHH CS
      Myb_DNA-binding    3 rWTteEdellvdavkqlGggtWktIartmgkgRtlkqcksrw 44  
                            WT +E   +++av  +G++ +++Iar++g +R+ +qck ++
  Glyma.20G178500.1.p  973 DWTDDEKTAFLQAVSSFGKD-FAKIARCVG-TRSQEQCKVFF 1012
                           5*****************99.*********.********766 PP

Protein Features ? help Back to Top
3D Structure
Database Entry ID E-value Start End InterPro ID Description
SuperFamilySSF466893.51E-14769829IPR009057Homeodomain-like
PROSITE profilePS5129316.282781832IPR017884SANT domain
SMARTSM007171.4E-9782830IPR001005SANT/Myb domain
PfamPF002491.0E-6784826IPR001005SANT/Myb domain
CDDcd001671.60E-7785827No hitNo description
Gene3DG3DSA:1.10.10.604.9E-6785826IPR009057Homeodomain-like
PROSITE profilePS5129312.839691020IPR017884SANT domain
SMARTSM007172.2E-89701018IPR001005SANT/Myb domain
SuperFamilySSF466896.47E-109711020IPR009057Homeodomain-like
PfamPF002495.9E-99731012IPR001005SANT/Myb domain
Gene3DG3DSA:1.10.10.602.5E-69731012IPR009057Homeodomain-like
CDDcd001671.10E-79741012No hitNo description
Gene Ontology ? help Back to Top
GO Term GO Category GO Description
GO:0005634Cellular Componentnucleus
GO:0003677Molecular FunctionDNA binding
Sequence ? help Back to Top
Protein Sequence    Length: 1665 aa     Download sequence    Send to blast
MPPEPLPWDR KDFFKERKHE RSESLGSVAR WRDSSHHRDF NRWGSAEFRR PPGHGKQGGW  60
HLFSEESGHG YAISRSSSDK MLEDDSRPSF SRGDGKYGRS SRENRGGPFG QRDWRGHSWE  120
PSNGSISFPR RQQDVNNDHR SIDDALAYSP HPHSDFGNAW DQHHLKDQHD KMGGVNDFGA  180
GPRCDRENSL GDWKPLKWTR SGSLSSRGSG FSHSSSSRSM GGADSHEAKA ELLPKSVAVN  240
ESHSGEAAAC ATSSVPSEDT TSRKKPRLGW GEGLAKYEKK KVEVPEASAN KDGPVLSTSN  300
TEPCNLLSPS LVDKSPKVIG FSECASPATP SSVACSSSPG MDDKLFGKTA NVDNDVSNLT  360
GSPAPVSENH FARFSFNLEK FDIDSLNNLG SSIIELVQSD DPTSLDSGPM RSNAINKLLI  420
WKADISKVLE MTESEIDLLE NELKSLKSES GETCPCSCPV ALGSQMVGGD EKYGEEHVGV  480
SDQVIRPLPL KVVDDPNTEK MPLSTNLHSI HENGKEEDID SPGTATSKFV EPLPLIKAVS  540
CDTRGYDNFS RDLDAVQSTA VKCLVPCTTR KEASVSTFVD GNTSMALKDS MDILYKTIIS  600
SNKESANRAS EVFDKLLPKD CCKIEKMEAS SDTCTHTFIM EKFAEKKRFA RFKERVIALK  660
FRALHHLWKE DMRLLSIRKC RPKSHKKNEL SVRSTCNGIQ KNRLSIRSRF PFPAGNQLSL  720
VPTSEIINFT SKLLSESQVK VQSNTLKMPA LILDEKEKMI SKFVSSNGLV EDPLAIEKER  780
AMINPWTPEE REVFLEKFAA FGKDFRKIAS FLDHKTAADC VEFYYKNHKS DCFEKIKKQD  840
GCKLGKSYSA KTDLIASGNK KLRTGSSLLG GYGKVKTSRG EDFIEKSSSF DILGDERETA  900
AAADVLAGIC GSLSSEAMSS CITSSVDPVE GNRDRKFLKV NPLCKPPMTP DVTQDVDDET  960
CSDESCGEMD PTDWTDDEKT AFLQAVSSFG KDFAKIARCV GTRSQEQCKV FFSKGRKCLG  1020
LDLMRPIPEN VGSPVNDDAN GGESDTDDAC VVETGSVVGT DKSGTKTDED LPLYGTNTYH  1080
DESHPVEARN LSAELNESKE IIGTEVDLED ANVTSGAYQI NIDSELGCDG SEVFLCVSNK  1140
SGSVGEQAGI IMSDSTEVGK DKANKLGGAA TELISAPDSS EPCESNSVAE DRMVVSEVSS  1200
GGLGNELERY RVSATLCVDD RDNKYEADSG VIVDLKSSVH DLSTMVNSSL SSLGTSCSGL  1260
SFCSENKHVP LGKPHVSALS MDDLLATSNS LLQNTVAVDV QCEKTASQDQ MSSTCDIQGG  1320
RDMHCQNSIS NAGHQLPITG NLSDHVDAVS ILQGYPFQVP LKKEMNGDMN CSSSATELPF  1380
LPHKIEQDDD HIKTFQSSDS DKTSRNGDVK LFGKILTNPS TTQKPNVGAK GSEENGTHHP  1440
KLSSKSSNLK FTGHHSADGN LKILKFDHND YVGLENVLEN VPMRSYGYWD GNRIQTGLST  1500
LPDSAILLAK YPAAFSNYPT SSAKLEQPSL QTYSKNNERL LNGAPTLTTT RDINGSNAVI  1560
DYQLFRRDGP KVQPFMVDVK HCQDVFSEMQ RRNGFEAISS LQQQSRGVMG MNGVGRPGIL  1620
VGGSCSGVSD PVAAIKMHYS NSDKYGGQTG SIAREDESWG GKGD*
3D Structure ? help Back to Top
Structure
PDB ID Evalue Query Start Query End Hit Start Hit End Description
4a69_C5e-17748834994NUCLEAR RECEPTOR COREPRESSOR 2
4a69_D5e-17748834994NUCLEAR RECEPTOR COREPRESSOR 2
Search in ModeBase
Expression -- UniGene ? help Back to Top
UniGene ID E-value Expressed in
Gma.66190.0cotyledon| flower| root
Regulation -- PlantRegMap ? help Back to Top
Source Upstream Regulator Target Gene
PlantRegMapRetrieve-
Annotation -- Protein ? help Back to Top
Source Hit ID E-value Description
RefseqXP_006606235.10.0PREDICTED: uncharacterized protein LOC100810588 isoform X5
TrEMBLK7N4570.0K7N457_SOYBN; Uncharacterized protein
STRINGGLYMA20G31871.10.0(Glycine max)
Orthologous Group ? help Back to Top
LineageOrthologous Group IDTaxa NumberGene Number
FabidsOGEF49863352
Representative plantOGRP32151725
Best hit in Arabidopsis thaliana ? help Back to Top
Hit ID E-value Description
AT3G52250.10.0MYB family protein